CDS

Accession Number TCMCG075C26802
gbkey CDS
Protein Id XP_007014211.2
Location 13028998..13030935
Gene LOC18589262
GeneID 18589262
Organism Theobroma cacao

Protein

Length 645aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007014149.2
Definition PREDICTED: uncharacterized protein LOC18589262 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category D
Description Belongs to the adaptor complexes medium subunit family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K10875        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03440        [VIEW IN KEGG]
map03440        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGTGCTTGGCACTTTCTCTCCAACCAGCCAACGGATCCGACATCCTTCTCCAAACCCGAGAATGGTTCCCTCCAGCCCGTGCCCTAGTTGCCCTCCATGCATTCCGCCAAACCCGTCTTGCCTTCTCCAACAAAAACCCTGCCTCCGCTGCCGCTTCCACCTCCGCCCCCTCATCGTCCTCCACCTCCGAATGTGATGCTGCCACTGAATCCATCGGCGACGACCCCCTTGCTGCCTCCAGCGGCCAATTAATCGTTGGCGTTGAAAGTAAGTATCGCGTCGTTTACCGCCTTGTGAACTCCATTTACGTCCTCGGAATCACTACCGCCGATCACGACAATTTGATCAACGTTTTCGAGTGCATCCACATAGTTAATCAAGCCGTTAGCGTCATCGTAACCGCCTGCCGTGGCGTGGACGTCACCCCCGAAAAACTCGCCCGCAAATATGCCGAGGTTTACATGGCACTCGACATTGTCCTGCGTGGAGTCAGCAACATCCGTCTCGCCGCCATGCTCTCCGCTATGCACGGCGACGGGATCGCCAAGATGGTCCATTCCGCACTAGACACCGAAGCCAAGATCCGTGGTGCTGACACGTGGCTGAACGTCGAAGCCCATTCGGTCGAACACCAATCCAACGTTGAAGCCTTTTCCAGTGCGAATTTCGAATTGCCACCAGAAACCCTAGCGGCGGGCGACCGGATAGCTTCAACGCTTGTACCTCAGAGTACAAGTGAGCAAGACGAGAAAATGGTTAAAGAAGAGAATTCAGAAGCCGTAAAGGATCCGTTCGCTGCGAGTGAATCCATAAATAAGCAAGAAGAGTTAGTTGGAGGGTTTAAGAAGACGAAGGATCCATCAGCTACTGATTTAACGGTGGCGTTGGCGGGGTTAGAGGTGACTACATTGCCTCCAGCTGAAGCAACCCAATCTACAGATATTACTGTTGAAGGGTTTGAAGGGAAGTATGGAGGTATTGAGTTTGGCAATGAACAGGCTACCCTTGGAGAAGCTTTTGAAGGGTTTAGTGATGCTTGGGGTGGAGGATTGGATGCTTCCGAGTTTTTGGAAAATAAAAAGGTTAAGAAACAGGAAGGACTTAGTGGGCTTGAACTTTTGCAAACTGGGGATAGTGCTGCTCCTCCAACTGCAGCTGCGGCTGGTGCTGATGGAGGAAAGTCTCTTGAGGATCTTTTGGTGAAGAAGACTGAGATGAAAGGTCCTGAAATGTATATTTCAGAGGAGATTAGTGCAGAGTTTAGGGAATCATTGCTTGCAAGAGTTGGATTAATGGGTGTGGTTTACTTGAGAACTATGCCTCCTAAAAATTCTGGTGATAAGGATGCTGAGTTTTCATTTCGTGTTGAAGGTACAAGTTCCGTTAAGAGGTTTGTTATGCAGAGTTCACGGGTTAGTAGCCTAGGTAATGGAATGTTTCATGTGAGAACTGCCCCATCTGAGGAGCCTATACCGATTTTGAAGTATAGTTTGTTACCTAGGTTGACACCATTGCCTTTGAGAATTAGGTTGATTAAACGTCAAAGTGGGACTTTACTTTCAGTAATGATACAGTATATTTCAAGCCCGGAGTTACTAGCACCATTGAATGATGTAACCTTTGTTCTGAAATTGCCAGTTGATCCAACATTGTTAAAGGTTTCGCCTAAAGCTGTGTTGAGCAGATCAGAGAGAGAATTGAAGTGGCATGTGCCGGAGATTCCACTGAAGGGTACACCTGGCAAGTTAAGAGTGAGGATGCCTGTGGATTCTAGTGAAGATGACGAAGACCTAGAAGTTGTTGGTTATGTAAAATTTTCAGTGCAAGGAGCTACTTCATTGTCTGGGGTCTGTCTGCGGGCTGCTTCTGAGGGTAAGACAGATTTTTATGAGGTGAATCATCGGTATGAGAGTGGTGTTTATATGTGCAATTGA
Protein:  
MSCLALSLQPANGSDILLQTREWFPPARALVALHAFRQTRLAFSNKNPASAAASTSAPSSSSTSECDAATESIGDDPLAASSGQLIVGVESKYRVVYRLVNSIYVLGITTADHDNLINVFECIHIVNQAVSVIVTACRGVDVTPEKLARKYAEVYMALDIVLRGVSNIRLAAMLSAMHGDGIAKMVHSALDTEAKIRGADTWLNVEAHSVEHQSNVEAFSSANFELPPETLAAGDRIASTLVPQSTSEQDEKMVKEENSEAVKDPFAASESINKQEELVGGFKKTKDPSATDLTVALAGLEVTTLPPAEATQSTDITVEGFEGKYGGIEFGNEQATLGEAFEGFSDAWGGGLDASEFLENKKVKKQEGLSGLELLQTGDSAAPPTAAAAGADGGKSLEDLLVKKTEMKGPEMYISEEISAEFRESLLARVGLMGVVYLRTMPPKNSGDKDAEFSFRVEGTSSVKRFVMQSSRVSSLGNGMFHVRTAPSEEPIPILKYSLLPRLTPLPLRIRLIKRQSGTLLSVMIQYISSPELLAPLNDVTFVLKLPVDPTLLKVSPKAVLSRSERELKWHVPEIPLKGTPGKLRVRMPVDSSEDDEDLEVVGYVKFSVQGATSLSGVCLRAASEGKTDFYEVNHRYESGVYMCN